AITopics | proxy function

Semantic-level watermarking (SWM) for large language models (LLMs) enhances watermarking robustness against text modifications and paraphrasing attacks by treating the sentence as the fundamental unit. However, existing methods still lack strong theoretical guarantees of robustness, and reject-sampling-based generation often introduces significant distribution distortions compared with unwatermarked outputs. In this work, we introduce a new theoretical framework on SWM through the concept of proxy functions (PFs) $\unicode{x2013}$ functions that map sentences to scalar values. Building on this framework, we propose PMark, a simple yet powerful SWM method that estimates the PF median for the next sentence dynamically through sampling while enforcing multiple PF constraints (which we call channels) to strengthen watermark evidence. Equipped with solid theoretical guarantees, PMark achieves the desired distortion-free property and improves the robustness against paraphrasing-style attacks. We also provide an empirically optimized version that further removes the requirement for dynamical median estimation for better sampling efficiency. Experimental results show that PMark consistently outperforms existing SWM baselines in both text quality and robustness, offering a more effective paradigm for detecting machine-generated text. Our code will be released at [this URL](https://github.com/PMark-repo/PMark).

arxiv preprint arxiv, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2509.21057

Country: Asia (0.67)

Genre: Research Report > New Finding (0.65)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

bd391cf5bdc4b63674d6da3edc1bde0d-Paper-Conference.pdf

Neural Information Processing SystemsAug-18-2025, 10:09:44 GMT

artificial intelligence, deep learning, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > Canada > Quebec > Montreal (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.93)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Balanced Filtering via Non-Disclosive Proxies

Deng, Siqi, Diana, Emily, Kearns, Michael, Roth, Aaron

arXiv.org Artificial IntelligenceJul-5-2023

We study the problem of non-disclosively collecting a sample of data that is balanced with respect to sensitive groups when group membership is unavailable or prohibited from use at collection time. Specifically, our collection mechanism does not reveal significantly more about group membership of any individual sample than can be ascertained from base rates alone. To do this, we adopt a fairness pipeline perspective, in which a learner can use a small set of labeled data to train a proxy function that can later be used for this filtering task. We then associate the range of the proxy function with sampling probabilities; given a new candidate, we classify it using our proxy function, and then select it for our sample with probability proportional to the sampling probability corresponding to its proxy classification. Importantly, we require that the proxy classification itself not reveal significant information about the sensitive group membership of any individual sample (i.e., it should be sufficiently non-disclosive). We show that under modest algorithmic assumptions, we find such a proxy in a sample- and oracle-efficient manner. Finally, we experimentally evaluate our algorithm and analyze generalization properties.

artificial intelligence, machine learning, proxy, (17 more...)

arXiv.org Artificial Intelligence

2306.15083

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Pennsylvania (0.04)
North America > United States > District of Columbia > Washington (0.04)

Genre: Research Report (1.00)

Industry:

Law (1.00)
Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.46)

Add feedback

Bootstrapped Training of Score-Conditioned Generator for Offline Design of Biological Sequences

Kim, Minsu, Berto, Federico, Ahn, Sungsoo, Park, Jinkyoo

arXiv.org Artificial IntelligenceJun-5-2023

We study the problem of optimizing biological sequences, e.g., proteins, DNA, and RNA, to maximize a black-box score function that is only evaluated in an offline dataset. We propose a novel solution, bootstrapped training of score-conditioned generator (BootGen) algorithm. Our algorithm repeats a two-stage process. In the first stage, our algorithm trains the biological sequence generator with rank-based weights to enhance the accuracy of sequence generation based on high scores. The subsequent stage involves bootstrapping, which augments the training dataset with self-generated data labeled by a proxy score function. Our key idea is to align the score-based generation with a proxy score function, which distills the knowledge of the proxy score function to the generator. After training, we aggregate samples from multiple bootstrapped generators and proxies to produce a diverse design. Extensive experiments show that our method outperforms competitive baselines on biological sequential design tasks. We provide reproducible source code: \href{https://github.com/kaist-silab/bootgen}{https://github.com/kaist-silab/bootgen}.

artificial intelligence, generator, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2306.03111

Country:

Europe > Austria > Vienna (0.04)
Asia > South Korea > Gyeongsangbuk-do > Pohang (0.04)

Genre:

Research Report > Promising Solution (0.66)
Research Report > New Finding (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Machine Learning in Least-Squares Monte Carlo Proxy Modeling of Life Insurance Companies

Krah, Anne-Sophie, Nikolić, Zoran, Korn, Ralf

arXiv.org Machine LearningSep-4-2019

In order to obtain reasonably accurate full loss distributions via a nested simulations approach as described in Bauer et al. (2012), their cash-flow-projection (CFP) models would need to be simulated several hundred thousand times. But the insurers are currently far from being endowed with sufficient computational capacities to perform such expensive simulation tasks. By applying suitable approximation techniques like the least-squares Monte Carlo (LSMC) approach of Bauer & Ha (2015), the insurers are able to overcome these computational hurdles though. For example, they can implement the LSMC framework formalized by Krah et al. (2018) and applied by e.g. Bettels et al. (2014) to derive their full loss distributions. The central idea of this framework is to carry out a comparably small number of wisely chosen Monte Carlo simulations and to feed the simulation results into a supervised machine learning algorithm that translates the results into a proxy function of the insurer's loss (output) with respect to the underlying risk factors (input). To guarantee a certain approximation quality, the proxy function has to pass an additional validation procedure before it can finally be used for the full loss distribution forecast. Machine Learning Calibration Algorithm Apart from the calibration and validation steps, we adopt the LSMC framework from Krah et al. (2018) without any changes.

artificial intelligence, machine learning, proxy function, (16 more...)

arXiv.org Machine Learning

1909.02182

Country:

Europe > Germany (0.92)
North America > United States (0.67)

Genre:

Research Report > Experimental Study (0.45)
Research Report > New Finding (0.45)

Industry: Banking & Finance > Insurance (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.92)

Add feedback

Generalization Error Bounds for Aggregation by Mirror Descent with Averaging

Juditsky, Anatoli, Nazin, Alexander, Tsybakov, Alexandre, Vayatis, Nicolas

Neural Information Processing SystemsDec-31-2006

For this purpose, we propose a stochastic procedure, the mirror descent, which performs gradient descent in the dual space. The generated estimates are additionally averaged in a recursive fashion with specific weights. Mirror descent algorithms have been developed in different contexts and they are known to be particularly efficient in high dimensional problems. Moreover their implementation is adapted to the online setting. The main result of the paper is the upper bound on the convergence rate for the generalization error.

aggregation, algorithm, descent algorithm, (11 more...)

Neural Information Processing Systems

Country:

Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
Europe > Netherlands > South Holland > Leiden (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.52)

Add feedback

Generalization Error Bounds for Aggregation by Mirror Descent with Averaging

Juditsky, Anatoli, Nazin, Alexander, Tsybakov, Alexandre, Vayatis, Nicolas

Neural Information Processing SystemsDec-31-2006

For this purpose, we propose a stochastic procedure, the mirror descent, which performs gradient descent in the dual space. The generated estimates are additionally averaged in a recursive fashion with specific weights. Mirror descent algorithms have been developed in different contexts and they are known to be particularly efficient in high dimensional problems. Moreover their implementation is adapted to the online setting. The main result of the paper is the upper bound on the convergence rate for the generalization error.

aggregation, algorithm, descent algorithm, (11 more...)

Neural Information Processing Systems

Country:

Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
Europe > Netherlands > South Holland > Leiden (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.52)

Add feedback

Filters

Collaborating Authors

proxy function

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

d601a9b708cacfad167f6c6c45647a18-Paper-Conference.pdf

d601a9b708cacfad167f6c6c45647a18-Paper-Conference.pdf

Bidirectional Learning for Offline Infinite-width Model-based Optimization Can (Sam) Chen

PMark: Towards Robust and Distortion-free Semantic-level Watermarking with Channel Constraints

bd391cf5bdc4b63674d6da3edc1bde0d-Paper-Conference.pdf

Balanced Filtering via Non-Disclosive Proxies

Bootstrapped Training of Score-Conditioned Generator for Offline Design of Biological Sequences

Machine Learning in Least-Squares Monte Carlo Proxy Modeling of Life Insurance Companies

Generalization Error Bounds for Aggregation by Mirror Descent with Averaging

Generalization Error Bounds for Aggregation by Mirror Descent with Averaging